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Abstract 

Tracker detectors can be used to identify charged particles based on their global x value obtained during track 
fitting with the Kalman filter. This approach builds upon the knowledge of detector material and local position 
resolution, using the known physics of multiple scattering and energy loss. The proposed method is independent 
of the traditional way of identification using deposited energy. The performance for present LHC experiments is 
demonstrated. 
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1. Introduction 

The momentum of a charged particle can be measured by examining the small angle scatters of the trajectory 
during propagation through the detector medium or tracker layers. For a recent application see Ref. [ 1 ] where the root 
mean square of the scattering angle distribution is computed for each track and compared to the theoretical estimate 
which is proportional to 1 //?/?. By assuming particle type, or at high momentum (J3 ~ 1), p can be estimated. This 
classical method underestimates momentum since the particle loses energy and its momentum decreases. 

The Kalman filter is widely used in present particle physics experiments for charged track and vertex fitting and 
provides a coherent framework to handle known physical effects and measurement uncertainties |2||. It is equivalent 
to a global linear least-squares fit which takes into account all correlations coming from process noise. It is the 
optimum solution since it minimizes the mean square estimation error. Recent studies show that this technique can be 
successfully used to improve momentum resolution of particles, even in experiments without magnetic field [3]. It is 
possible via the effects of multiple scattering. If the detector is in magnetic field, the momentum of charged particles 
can be obtained from the bending of the trajectory. Hence track fitting may provide additional information that could 
constrain the velocity of the particle, thus contributing to particle separation or identification. 

This article is organized as follows: Sec. [2] introduces the merit function of a track fit^ and discusses its charac- 
teristics. Sec.[3]deals with physical effects during track propagation, while in Sec. [4] the basic scaling properties of^ 
are given. In Sec. [5] the details of the Monte Carlo simulation and the obtained performance are shown. This work 
ends with conclusions and it is supplemented by two Appendices. 

2. The merit function of the fitted track 

There are various merit functions that can characterize the goodness of a track fit: sum of the squared and properly 
normalized predicted (P), filtered (F ) or smoothed (S) residuals. It can be easily shown that for each hit^p = x%- 
The filtered residuals are uncorrected and in the Gaussian case independent. Hence YuX\ is chi-square distributed 
with r = [J]k dim(mfc)] - n p degrees of freedom, where dhn(m^) is the dimension of the kth hit on track and n p is the 
number of track parameters. 
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Tests with smoothed residuals (e.g. for outlier removal) appear to be more powerful J2l, but the correlations of 
these residuals between the states have to be taken into account. Their global covariance matrix Hki between smoothed 
states k and I can be calculated |4] with the recursion 



Ct u =A k - X Cl p k<l 

and 

%d = Vkdki - HkC n k l Hj 

where C is the smoothed covariance matrix, A is the gain matrix, V is the covariance of measurement noise, H is the 
measurement projection matrix. Here we follow the notations of Refs. J2, Hi- The vector of smoothed residuals is 
described by a multivariate Gaussian distribution with the global covariance K obtained above. Since Kalman filtering 
consists of a series of linear transformations, the smoothed residuals can be obtained from the predicted ones by a 
linear transformation r$ = Brp. Note that no translation is allowed, since the average of both residuals is zero. The 
global covariance matrix of predicted residuals is Hp, the covariance for r$ is Us = B%pB T . Thus, the expression for 
the corrected sum of smoothed values is 

= 'W* = (Bprp) T (Bp<R P B T p r l (Bprp) = 

= (r T P B T P )(B T p - 1 <R P l B P l )(Bprp) = r T P <R P l r P = ^Xp- 

It is clear that the correlations are transformed out and we get back simply the predicted or filtered values: YiXp = 
YuX\ - ^ZiX\y ■ Hence the most straightforward quantity to calculate is the sum^- 2 = YuXp using predicted residuals 
which will be used in the remaining part of this study. 

During track propagation the mass of the tracked particle has to be assumed. In collider experiments it is often set 
to the mass of the most abundantly produced particle, the pion, or that of the muon. The obtained merit function with 
mass assumption mo is 

X 1 (m ) = J]rlR- k 1 r i 

k 

where the index k runs for all the measurements and is the local covariance matrix for the Mi measurement. If the 
largest contributions to /?* are independent in r<p and z directions, % 2 can written as 

V \ cr,(mo) I 

Zi (Ti(m) \ 2 / x ; - fii(m )\ 2 ^ 

where i runs for all split measurements and <r, are the corresponding standard deviations. The resulted sum is a linear 
combination of non-centrally chi-square distributed independent random variables z, with weights a,. The distribution 
functions are fx(Zi', where 

_ / cr,(m) \ 2 _ / j u,(m)- j u,(m ) \ 2 

\o-,(m )/ ' ' \ o-i(m) J 

The sum in Eq. (Q3 can approximated by a single rescaled non-central chi-squared distribution l/a 2 fx(x/a 2 ; r, A 2 ) 
such that 

2 I>? (I>/) 2 ,2 V J 

a = — , r = z n„, A = ) Aj 
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Figure 1: The contributions to deviations and shifts of the predicted hit in a B = 3.8 T magnetic field, after crossing x/Xq = 2% silicon and 
further / = 5 cm propagation before reaching the next layer, as a function of particle momentum. Left: expected standard deviations due multiple 
scattering. Right: expected shifts, compared to an average propagation with n mass assumption, due to energy loss. The curves give the limits of 
the lower and upper ±l<x confidence intervals for several particle types. For comparison lines corresponding to a local position resolution of 25 /jm 
are drawn. 



where n p is the number of track parameters. For details see AppendixlBl If m = mo, we get a,- = 1, or = 1, A = 0, and 
the distribution is a chi-squared one. If the ratio of expected variances a, are similar for all i, we get 



[[o-i(m )J ) ^ 

and r is the number of split measurements decreased by the number of track parameters. 

At the same time the use of the variable x = V? appears to be more practical. It is described by a scaled 
non-central chi-distribution 1 /a fix I a; r, A) and well approximated by a Gaussian with parameters 



For detailed derivation see AppendixlAl 

The value of x can be calculated for each track during the track fit with Kalman filter. For different type of 
particles it will have different distribution function, because the parameters fi x and cr x (via a and r) depend on the 
ratio of expected hit deviations cr,(m)/cr,(mo) which are mass dependent (see Sec. [3). This observation allows to use 
this quantity in particle identification. Using the Gaussian approximation of Eq. (O, the separation power p x of x 
between particles of mass m\ and mi is 

2[fi x (mi) - fi x (m 2 )] 
Px = i ^= (4) 



3. Physical effects 

When a stable charged particle propagates through material the most important effects which alter its momentum 
vector are multiple scattering (ms) and energy loss {el). In the following the expected spatial shift 5 and deviation <x 
will be calculated. They are to be compared with the resolution of the local position measurement cr pos of the tracker 
layers. 
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The distribution of multiple Coulomb scattering is roughly Gaussian ||5|], the standard deviation of the planar 
scattering angle is 

Oo = U - 6 MeY z ^pT [l + 0.038 Hx/Xo)] (5) 
Pep 

where p, fie, and z are the momentum, velocity, and charge of the particle in electron charge units, and x/Xq is the 
thickness of the scattering material in radiation lengths. While the expected shift is 5 ms = 0, the average deviation on 
the next tracker plane after a flight path /, in case of normal incidence, is 

<r ms ~ / 8q. (6) 

Momentum and energy is lost during traversal of sensitive detector layers and support structures. To a good 
approximation the most probable energy loss A p , and the full width of the energy loss distribution at half maximum 
T A 10 are 



2mc 2 B 2 y 2 £ , 

In + 0.2000 -p 2 - 5 

I z 



(7) 



T A = 4.018^ (8) 

where 

K o Z x 
* = 2 Z ~A P J 

is the Landau parameter; K = 4wNAr%tn e c 2 ; Z, A and p are the mass number, atomic number and the density of the 
material, respectively [Q]. Since this study deals with momenta below 2 GeV/c, the density correction 6 was neglected. 

In most cases tracker detectors are placed in magnetic field (B). Given the radius of the trajectory r and the length 
of the arc /, the central angle is <p = l/r. If the radius is changed by 6r, the angle changes by dtp = -II r 2 6r and the 
position shift of the trajectory after I path is 

6 et ~ I 8<p/2 = -I 2 12 6r/r 2 
At the same time p = Q3Br, EdE = pdp. Hence 

0.3B1 2 <A> 



S, 



el 



2 f3p* 

Similarly, the expected deviation is 

0.3B1 2 o-A 

CT e l 



2 ppt- 

The contributions to deviations and shifts of the predicted hit in a B — 3.8 T magnetic field, after crossing x/Xq = 
2% silicon and further / = 5 cm propagation before reaching the next layer, are shown in Fig. Q] Standard deviations 
are dominated by multiple scattering, although at very low momentum the energy loss, at very high momentum the 
local position measurement also plays a role. Shifts from energy loss are only relevant at very low momentum, but 
they are still very small compared to standard deviations. 



4. Properties of \ 

It is important to study the sensitivity of the measured^ distribution at a given total momentum p. The parameters 
which govern the distribution (Eq. 0}) are the rescaler a, the average shift A and the number of degrees of freedom r. 
In this section we estimate them, as well as the separation power p x listed in Eq. Q, based on physical effects. 

Since the deviations are dominated by multiple scattering and local position measurement, a in Eq. (f2]i can ap- 
proximated as 



\ 

\ \ 

\ \ 

\ 
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Figure 2: The average shift for several particle types, in a B = 3.8 T magnetic field, with layer thicknesses of x/Xq = 2% silicon, an average 
propagation length of / = 5 cm, in case of r =16 number of degrees of freedom, as a function of the particle momentum. 



which can be further simplified, if cr pos <k cr ms , to 



y3(m ) 

a ~ 

B(m) 



2 \' pHm ) 

where the sensitivity is defined as £ = <T pos /o- ms (mo), it is proportional to l/Bp (Eqs. ©-(O). If the local position 
resolution can be neglected <k 1) we get 

a ss . (9) 

B(m) 

Shifts come entirely from differences in energy loss, hence contributions to A are only substantial at low momen- 
tum: 



r Q3Bl 2 (A(m) - A(m )> 
yr 



lyfFli 1 1 



p \/3 2 (m) /3 2 (m )J' 

The average shift (A) in a B = 3.8 T magnetic field, with layer thicknesses of x/Xq - 2% silicon, an average 
propagation length of I = 5 cm, in case of r — 16 number of degrees of freedom, is shown in Fig. [2] 
If A, ( <K 1, the separation power p x between particles m and mo is 

ft »2V5^i izmm do) 

Vl + [B(m)/B(m )] 2 

Hence if the momentum is not very low and the local position resolution is small compared to deviations from multiple 
scattering, neither the rescaler a nor the separation power p x depends on the details of the experimental setup, such 
as magnetic field, radii of tracker layers, value of local position resolution and material thickness. In this respect the 
only decisive parameter is the number of split measurements which enters the above expressions by the number of 
degrees of freedom r. The mean and variance of the corresponding Gaussians are fully determined by the momentum 
and mass of the particles via B. 

Although at low momentum the prediction of the means is more difficult due to the increasing A, the variances 
still stay the same. The x distribution can be easily unfolded, since the separation power is large, allowing for a 
many -parameter fit. 
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Table 1: Important characteristics of the inner barrel detectors of the studied experimental setup. For details see text at the beginning of Sec. [5] 
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4.1. Applications 

The measured value of \ is sensitive to the proper spatial alignment of the detector layers and to the correct 
estimate of the variation of the predicted local position. If the alignment precision is sufficient, the latter is mostly 
determined by the contribution from multiple scattering which is closely proportional to -\/x/Xq. While p and r are 
well measured, the amount of material in the detector can be 

• understood: the unfolding of the x distribution in a phase space bin enables the measurement of yields of 
different particle species. 

• poorly known: the unfolding of the x distribution in a phase space bin may provide corrections to the mate- 
rial thickness. They can be extracted by fitting the x distribution with an additional rescaler. Note that the 
measurement of yields of different particle species is still possible, although with lower confidence. 

5. Simulation 

The proposed method was verified by a Monte Carlo simulation. As examples from LHC, the performance of 
simplified models for the inner detectors of the following experiments were studied: 

• ATLAS (Exp A): three layers of silicon pixels, five layers of double-sided silicon strips, up to 36 layers of straw 
tubes 01. 

• ALICE (Exp B): two layers of silicon pixels, two layers of silicon drifts and two layers of double-sided silicon 
strips [8, 9]. Due to the large £ value of the gas detector (TPC) its measurements were not included. 

• CMS (Exp C): three layers of silicon pixels, ten layers of silicon strips (four of them double-sided) fioh . 

Some relevant details of the experimental setups are given in Table Q] For simplicity a homogeneous longitudinal 
magnetic field was used, and detector layers were assumed to be concentric cylinders around the beam-line. Pixels, 
double-sided strips (superscript ! ), drift layers and gas provide measurements in two dimensions (n/> and z), while 
one-sided strips and straw tubes give only measurement in one direction (n/>). x/Xq values are given per layer and 
they are rounded to integers where possible. Sensitivity values and f z are shown for pions at p = 1 GeV/c, normal 
incidence, rounded to one significant digit. The number of split measurements are also indicated. 

The initial state vector was estimated by fitting a helix to the first three hits. (These hits are two-dimensional 
in all three examined experimental setups.) The starting values of the track parameters were extracted at the closest 
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Figure 3: Distributions of x for several particle species. The relative yield of particles was set to n : K : p : e = 70 : 10 : 18 : 2. Results are shown 
for T] = 0,pr = 0.4 GeV/c (upper row) and r\ = 0,pr = 0.8 GeV/c (lower row) with setups Exp A, B and C. Individual fits with chi distributions 
are indicated by thin solid lines. 



approach to the beam line. The track fitting was performed by a classical Kalman filter Q with pion mass assumption. 
The state vector x - (k, 9, \p, rc/>, z) is five dimensional, where 



k = q/p 

e = e(p) 

rip — r<p(r) 
Z = r L 



(signed inverse momentum) 
(local polar angle) 
(local azimuthal angle) 
(global azimuthal position) 
(global longitudinal position). 



The propagation from layer to layer was calculated analytically using a helix model. Multiple scattering and energy 
loss in tracker layers was implemented with their Gaussian approximations shown in Eqs. (O-©. The propagation 
matrix F = df/dx was obtained by numerical derivation. The measurement vector m = (r<p, z) is two dimensional, the 
measurement operator is 

(0 1 0\ 
1 " 



H 



The covariance of the process noise Q is 

Q = {F K 



. Fl)^ + (F e 9 FDo* + (f> ® Fl)cA 



where cr K = Kcr^/ft, ctq-ct^- 6q and F a = df/dx a is a vector. The covariance of measurement noise V is 

Note that multiple scattering contributes equally to the variation of 6 and i/r, while energy loss affects only k. 
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5.7. Results 



In order study the performance of x, charged pions, kaons, protons and electrons with random azimuthal angle 
were generated and emitted normal to the line of the colliding beams (77 = 0) and run through the above outlined 
reconstruction. 

Distributions of x using 10 5 particle tracks are shown in Fig.[3]for pj = 0.4 and 0.8 GeV/c. For a realistic particle 
composition the relative yields were set to ji : K : p : e = 70 : 10 : 18 : 2. At pj — 0.4 GeV/c, in case of Exp A, the 
protons are detached, but there is a good 71-p separation for Exp B and C, as well. For Exp A and C the x-K separation 
allows for yield estimation. Even at pj = 0.8 GeV/c the observed resolution is enough to extract the protons. When 
fitting the histograms a sum of chi distributions was employed (thin solid lines), but a sum of Gaussians may also be 
sufficient. 

For a complete picture charged pions, kaons, protons and electrons with transverse momenta pj = 0.2, 0.4, 0.6, 
. . . , 2.0 GeV/c and r\ — were used, amounting to 10 4 particles per pj setting for each particle type and experimental 
setup. The performance oi x as function of p for all three setups is shown in Fig. [4] The subsequent rows give the 
dependence of the measured rescaler a, the fitted number of degrees of freedom r, the merit function of the histogram 
fit with sum of chi distributions and the separation power p x . This latter was calculated by using the measured 



a and r values with help of Eqs. — ©. The measured values are shown by the symbols. In case of a the line gives 
the plain f3{mo) I f3{m) scaling (Eq. (|9]l) that works rather well for all three setups and for all particle types. For r the 
horizontal lines show the number of split measurements for a given pr, decreased by the number of track parameters 
n p . While these predictions are closely followed by the measured values in case of Exp C, there are substantial 
deviations with the other two setups. It can be traced back to low sensitivity measurements: large number of straw 
tubes with ^ = 10 (Exp A), and two strip layers with ( z = 7 (Exp B). In case of the separation power p x the lines 
show the approximation based on the predicted number of degrees of freedom and the ratio f5(m)//3(mo), calculated 
with help of Eq. (II Oi l. The steps are due to the changing number of crossed detector layers with varying p. The 
approximation works well for Exp C, but strongly overestimates the measured value for Exp A. It is again due to the 
large number of low sensitivity measurements. 

Comparison of the n-p separation power of the % measurement for several experimental setups as a function of 
momentum is shown in Fig. While Exp A clearly performs better for p < 0.6 GeV/c, Exp C has better resolution 
for the more critical higher momentum region. With the most sensitive setups (Exp A and C) protons are lcr apart if 
p < 1.4 GeV/c, while 2<x separation is reached if p < 1 GeV/c. For kaons these numbers are p < 0.9 and 0.5 GeV/c, 
respectively. 

6. Conclusions 

It was shown that tracker detectors can employed to identify charged particles based on their global x obtained 
during track fitting with the Kalman filter. This approach builds upon the knowledge of detector material and local 
position resolution, using the known physics of multiple scattering and energy loss. The study using simplified models 
of present LHC experiment shows that n-K and n-p unfolding is possible at low momentum. The separation is better 
than lcr for p < 0.9 and 1.4 GeV/c, respectively. In general, the performance of an experiment is determined by the 
number of good sensitivity split measurements. It is also a strong function of particle momentum. 

If particles can be identified based on informations from other sources (e.g. independent AE/dx measurement) 
this tool can still be useful to provide corrections to the amount of material in the detector and to check the obtained 
precision of its alignment. 
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A. Properties of some distributions 




In this section the definitions of some used distributions are listed along with their calculated or approximated 
values for the mean p and variance a 2 . 
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Figure 4: Performance ofx measurement for particle identification, with setups Exp A, B and C. For details see text in Sec. 15. II 
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Figure 5 : The n— p separation power of the x measurement for the experimental setups, as a function of momentum. The lines are drawn to guide 
the eye. 



A.l. x 1 distribution 

The distribution, mean and variance are 



P(x;r) = 
fi = r, 



x r/2-l e -x/2 

cr 2 =2r. 



A.2. Non-central x 1 distribution 

The distribution, mean and variance are 

P(x; r, A) = 



-(x+A)/2 x (r-V)/2 



2(Ax) r ' 4 
H = r + A, o 2 = 2(r + 2A) 



I r /2-i ( V^) 



where I„(x) is the modified Bessel function of the first kind. 

A.3. x distribution 

The distribution and mean are 



^ = 



P(x; r) = 
V2T(r±l) 



2l-n/2 x n-\ e -x 2 /2 



r (i) 



r(5) 



= V7 



r 4 



where Ref. [11] for r » 1 was used. The variance is 



(11) 
(12) 



cr = r — u. sa —, 
P 2 



(13) 
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A.4. Non-central % distribution 
The distribution and mean are 



-(x 2 +A 2 )/2 x r A 

P(x;r,A) = --^r^ Ir/2-i(*x) 



(AxY' 2 



-A' 



where L { ,"\x) is the generalized Laguerre function. For r » 1, with Kummer's second formula Ill2ll 



r(*M§) 



1 r -A 2 
1jFi| "2'2'~ 



where \F\(a,b, z) is the confluent hypergeometric function of the first kind. With help of Eq. (TT2l) and Ref. (|l_3| 
assuming A 2 <k r 



, ^ 2 ,M 
1 + 



For the variance 



a 2 = r-/ J 2 +A 2 * -. 



, l-2i 2 /l 



r-- + ^ 2 . (14) 



(15) 



Note that with A - Owe get back the mean of the;f distribution (Eqs. (TT2l and (TBI) ), while the variances are the 
same in the central and non-central case (Eqs. ( TT3l > and (TT5b). 



B. Sum of non-central chi-squared distributed independent variables 

The goal is to approximate the sum 

y = J] UiZi 

where z, are non-central chi-squared distributed independent random variables with one degree of freedom and density 
function fxizc, l,Aj). Although an explicit expression for the distribution of y exists, it is difficult to evaluate in 
practice 11411 . Here this function is approximated by a rescaled non-central chi-squared distribution l/afx(x/a; r, A) 
by requiring that the first two moments be the same. The means and variances are additive, thus the equations two 
solve are 

(y)= ^]fl,(l+ A,-)= a(r + A) 

i 

<(v - (y)) 2 } = 2 J] a 2 (l + 2Ad = 2a\r + 2A) 

By assuming A, <K 1 we get 

!>• 

a = — — , 

Zi at 

with relative corrections of the order 0(A 2 /r 2 ). 
approximated by 



Oi) 2 

r — 



-s- 



Si"? ' 

If the values of a, are similar some of the above expressions can be 



a « (a,), r ~ n. 
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